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TITLE 

Method and computer program for digital image processing 
for two-dimensional electrophoresis . 

DESCRIPTION 

FIELD OF THE INVENTION 

The present invention relates to methods and devices for digital image processing of 
biological sample separations. More specifically It relates to techniques for processing 
digital image data for a two-dimensional array of sample substance spots and marker 
substance spots in an electrophoresis gel by image normalisation. 

BACKGROUND OF THE INVENTION 

In a cell obtainable from e.g. a cell culture or tissue sample an existing pool of 
proteins, a proteome, exists as a part of biological processes and functions. In many 
different clinical, diagnostical and analytical situations it is desirable to gain knowledge 
about the identity and amount of a protein or a group of proteins at a certain point in 
time, and also of certain changes over time. Certain biological processes are defined 
by changes in morphology and physiology due to changes in the expression, i.e. the 
20 protein level, of particular genes. Also, developmental stages of cells can be defined 
and monitored by their global pattern of gene expression and the progressive changes 
that occur over time of particular genes or groups of proteins. Even more, as a 
response to treatment of cells with chemical factors, e.g. drugs, hormones, nutrient 
factors, environmental factors and other growth condition factors, specific proteins of 
25 groups of proteins can change their expression and as such allow for monitoring or 
identifying a response or treatment. Information of this type can be used to e.g. 
detect, identify and classify tumours in terms of malignancy, evaluation of therapies, 
adjustment of therapies as well as diagnosis and prognosis. Also, changes in proteins 
due to mutations, cleavage, phosphorylation or glycosylation can be obtained. Further 
30 identification of unknown proteins can e.g. be done by mass-spectrometry of 
unidentified sample protein spots on the gel. 

A general method used for detecting, identifying, monitoring and quantifying the 
compositions of complex biological mixtures is two-dimensional gel electrophoresis. In 
35 such a method the separation performed in two dimensions enables detection and 
identification of a large number of components that would not be separable and 
distinguishable in a one-dimensional separation, such as a linear separation. A 
frequently used method for monitoring the protein expression by 2D (two- 
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dimensional) gel electrophoresis comprises the feature of analysing a set of multiple 
gels in one run. 

A typical prior art method is detection and identification of sample spots in a 2D 
5 polyacrylamide gel. By using large gels, the number of proteins to be detected and 
identified can be several hundred up to about 10 000 in one gel. Each spot is detected 
by means of a signal derived from the individual spot. In a typical gel size of 24x18 
cm, several thousands of sample protein spots are detected. Normally, about 10% of 
the total number of sample spots need to be selected as so-called landmarks. After a 

10 separation process has been performed, the spatial relation between these landmarks, 
or markers, and the sample spots is measured and used for the purpose of 
identification of the sample substances, in a manner well known to the skilled person. 
This further involves a few hundred manually selected landmarks in each gel. The 
process of detection, identification and analysis of the separate components in a two- 

15 dimensional electrophoresis gel is a complex and difficult task, and often involves 
tedious and time consuming manual steps demanding months of experience to 
perform. Several attempts have been made to develop means and methods to 
simplify, standardize and automise the procedure as described in Smilansky et al., 
2001, Electrophoresis 22: 1616-1626; Thompson et al., 1998, in 20 th Annual 

20 International Conference of the IEEE Engineering in Medicine and Biology Society, Vol. 
20: 1060-1063; Wanatabe, et al., 1998, in 20 th Annual International Conference of the 
IEEE Engineering in Medicine and Biology Society, Vol. 20:804-808. 



The manner in which the sample protein spots are distributed across the two- 
25 dimensional gel-electrophoresis depend on the separation parameters used in the first 
and second dimension in the two-dimensional electrophoresis procedure. Two 
parameters commonly used are isoelectric point and molecular size (Gorg et al., 2000, 
Electrophoresis 21:1037-1053). Furthermore, various ways of obtaining detectable 
signals from the spots have been used, such as post-gel colouring techniques, e.g. 
30 silver staining as described in Sinha et al., 2001, Proteomics 1:853-840; Shevchenko 
et al., 1996, Anal. Chem. 68:850-858. Other techniques that are used include optical 
density, radioactive emission, fluorescence emission, and colorimetric signals (Pavon 
et al., 1999, J Interferone Cytokine Res.19: 589-599; Steinberg et al., 2001, 
Proteomics, 1:841-855; Herich et al., 2001, Biotechniques, 31:146-149; Kemper at 
35 al., 2001, Electrophoresis, 22:970-976; Lauber et al., 2001, Electrophoresis 22:919- 
932; Berggren et al., 2000, Electrophoresis 21:2509-2521; Steinberg et al., 2000, 
Electrophoresis 21:486-496). Detection of the spots is commonly achieved by the use 
of imaging devices that convert the detected signals into digital data and store the 
data as information on computer storage media. 
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One of the key features in identifying the sample spots is the assignment and use of 
reference spots known as "landmarks". The landmarks are actual protein spots that 
are manually selected by the user, i.e. manual landmarking. Computer software is 
further used for processing the manually selected spots in order to automatically 
5 detect and identify the same spots in all of the member gels in one matchset. 

These user-selected landmarks are preferably relatively few in number compared to 
the total number of sample spots in an individual electropherogram. The selection 
criteria of the manual landmarks are that the spots should be well-resolved, that they 
are well isolated from other spots, and that they appear in all the gels of the matchset. 

10 The number of spots to be assigned as landmarks must be large enough that all of the 
remaining protein spots among the various gels will be successfully matched by the 
automatic processing. The function of the landmarks is to serve as guideposts in the 
gel-to-gel comparison, thereby aiming at reducing and compensating differences and 
distortions among the member gels in the matchset to assure that there will be a 

15 proper correspondence of protein spots among different gels in the matchset. 

Further, by using known sample proteins manually chosen as landmarks in the 2-DE 
one has to consider only proteins that exists in all gels in one matchset. Such protein 
must exist in a relatively large amount to be readily repeatable and detectable in all of 
20 the gels. Due to this, the spots chosen as protein landmarks are often large and blob- 
like. This gives less defined landmarks that are difficult to position accurately, i.e. to 
find the exact centre of the landmark, which is a source of inaccuracy on the analysis 
process of the sample protein spots. 

25 The process of selecting and marking spots to be used as landmarks is slow and 
tedious, and is one of the limiting factors in two-dimensional gel electrophoresis 
analysis. Normally, since about 10% of the total number of protein spots needs to be 
selected as protein landmarks to make the image analysis algorithm work correctly, 
several hours must be used for each run to pick said landmarks manually. Also, in 

30 addition to assigning the above described selection criteria for manually chosen 

landmarks, the time involved in making the selections and making the spots adds to 
the cost and time involved in performing the analysis. Even further, the level of user 
involvement raises question regarding reproducibility and reliability. 

35 Being a generally known and widely used technique, the state of the art comprises 
several disclosures relating to two-dimensional electrophoresis. 

US 5 139 630 teaches a method for detecting and identifying protein species in a 
sample by capillary zone electrophoresis by the addition of at least two external 
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markers, one being an ionic species and one being a neutral charged species. This 
method is only applicable for capillary electrophoresis, i.e. where the separation is in 
one dimension, here disclosed for charge densities, and not applicable for a separation 
using more than one dimension as in e.g. a two-dimensional gel electrophoresis. 

WO 01/07920 discloses a method for automated landmarking for two-dimensional gel 
electrophoresis by the addition of marker proteins to the sample proteins. However, 
the use of proteins as external landmarks has several disadvantages, such as high 
production costs and short shelf life of the final protein product. Further, the use of 
protein landmarks, means that special means have to be taken to make such proteins 
visualized in contrast to proteins present in a sample material. 

In the light of the aforementioned problems it is thus an object of the invention to 
provide means and methods for reducing time and cost relating to the process of 
electrophoresis. An aspect of this object is to increase the reproducibility and reliability 
of two-dimensional gel electrophoresis used for separation, detection, identification 
and quantification of proteins, and to provide a solution overcoming the problems 
associated with the prior art means and methods. 



SUMMARY OF THE INVENTION 

In order to fulfil the objects above, the present invention provides means and methods 
for using detected signals from landmarks, herein also called markers, having known 
properties in order to transform a detected 2D gel image to a form in which it is 
comparable to other images of 2D gels. The landmarks may be selected from the 
sample, provided it is known that they will be present. Alternatively, a set of so-called 
external landmarks is used, which external landmarks have been added to the sample 
beforehand. The term external is simply used to point out that the landmark 
substances used are not selected from the sample substances to be determined, but 
are included in addition to the sample substances to be determined and quantified, 
prior to the electrophoresis process. Hence, these external landmarks are subjected to 
the same and separation process as the samples substances to be determined. 
Preferably, such external landmarks are artificially created, such that their properties 
will give rise to a desired spatial distribution of landmark spots in the gel after an 
electrophoresis run. 

According to a first aspect of the invention it relates ethod for processing digital image 
data for two-dimensional arrays of sample substance spots and marker substance 
spots in a plurality of electrophoresis gels by using marker substances having 
predefined properties, comprising the steps of: 
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- generating marker images represented by marker image data sets comprising 
coordinate data and signal values corresponding to detected positions and signal 
values of said marker substance spots in said arrays; 

- processing said marker images to determine a mathematical relation between said 

5 marker signals in said marker images such that said respective signals are mapped on 
each other; 

- generating sample images represented by sample image data sets comprising 
coordinate data and signal values corresponding to detected positions and signal 
values of said sample substance spots in said arrays; 

10 - transforming said sample image signals according to said mathematical relation 

In a preferred embodiment the invention is a method for the detection, matching and 
quantification of sample substances in a plurality of electrophoresis gels by using 
external marker substances having predefined properties, the method comprising the 
15 steps of: 

- combining each of said gels with said sample substances and a plurality of said 
external marker substances each having a known identity and known characteristics 
that affect its migration in two-dimensional electrophoresis, which said external 
marker substances are selected such that they are distinguishable from said sample 

20 substances; 

- performing two-dimensional electrophoresis on said gels thus combined with said 
sample substances and said external marker substances to form a two-dimensional 
array, each array including said external marker substance spots corresponding to said 
external marker substances and sample spots corresponding to said sample 

25 substances; 

- scanning each array to form a first image of signals corresponding to said external 
marker substance spots and a second image of signals corresponding to said sample 
substance spots, the signals of said first image being distinguishable from those of 
said second image; 

30 - processing said first image together with the known identity and known 

characteristics of said external marker to determine a mathematical relation between 
said first image signals and signals representing said known identity and said known 
characteristics of said external marker substances such that said respective signals are 
mapped on each other; 

35 - transforming said second image signals dependent on said mathematical relation 
such that the external marker substance spots are positioned as external marker 
reference spots on the co-ordinates in the two-dimensional array that correspond to 
their properties in said electrophoresis gel; 
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- determining the properties of said sample spots dependent on said external marker 
reference spots in said transformed second image. 

In another preferred embodiment the invention is a method for analysing samples 
5 substances by comparison between a first electrophoresis gel and a second 

electrophoresis gel by using external marker substances having predefined properties, 
the method comprising the steps of: 

- combining each of said gels with said sample substances and a plurality of said 
external marker substances each having a known identity and known characteristics 

10 that affect its migration in two-dimensional electrophoresis, which said external 

marker substances are selected such that they are distinguishable from said sample 
substances; 

- performing two-dimensional electrophoresis on said gels thus combined with said 
sample substances and said external marker substances to form a two-dimensional 

15 array, each array including said external marker substance spots corresponding to said 
external marker substances and sample spots corresponding to said sample 
substances; 

-scanning each array to form a marker image of signals corresponding to said external 
marker substance spots and a sample image of signals corresponding to said sample 
20 substance spots, the signals of said marker image being distinguishable from those of 
said sample image; 

- determining a mathematical relation between the signals of the respective marker 
image of said sample gels, such that the signals corresponding to spots of the same 
marker substance in the marker images of the first and the second electrophoresis gel 

25 are mapped on each other; 

- transforming the sample image of the first gel dependent on said mathematical 
relation, such that the sample images of both gels relate to the same external marker 
substance positions; and 

- determining differences between the two electrophoresis gels by directly comparing 
30 the transformed sample image of the first gel with the sample image of the second 

gel. 

In a further preferred embodiment the invention is a method for analysing samples 
substances by comparison between a first electrophoresis sample gel and a second 
35 electrophoresis gel by using external marker substances having predefined properties, 
the method comprising the steps of: 

- combining each of said sample gels with a common set of external marker 
substances each having known properties affecting its migration in two-dimensional 
electrophoresis; 
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- performing two-dimensional electrophoresis on each of said sample gels thus 
combined with said external marker substances to form a two-dimensional array, each 
array including external marker substance spots corresponding to said external marker 
substances and sample spots corresponding to said sample substances; 

5 -scanning each array to form a marker image of signals corresponding to said external 
marker substance spots and a sample image of signals corresponding to said sample 
substances spots, the signals of said marker image being distinguishable from those of 
said sample image; 

- applying a mathematical relation between the signals of the respective marker image 
10 of said sample gels, such that the signals corresponding to spots of the same marker 

substance in the marker images of the first and the second electrophoresis gel are 
mapped on each other; 

- transforming the sample image of the first gel dependent on said mathematical 
relation, such that the sample images of both gels relate to the same marker 

15 substance positions; and 

- determining differences between the two electrophoresis gels by directly comparing 
the transformed sample image of the first gel with the sample image of the second 
gel. 

20 In another preferred embodiment the invention is a method for processing digital 
image data for a two-dimensional array of sample substance spots and landmark 
substance spots in an electrophoresis gel by using landmark substances having 
predefined properties, comprising the steps of: 

- defining ideal image data comprising co-ordinate data corresponding to ideal 

25 positions of the marker substance spots in said array dependent on electrophoresis 
conditions; 

- generating a marker image represented by marker image data comprising co- 
ordinate data corresponding to detected positions of said marker substances spots in 
the array; 

30 - determining a mathematical relation between the ideal image data and the marker 
image data, such that the co-ordinate data of said images are mapped onto each 
other; 

- generating a sample image represented by a sample image data set corresponding to 
detected signal values in the gel; 

35 - transforming said sample image according to said mathematical relation. 

In still another preferred embodiment the invention is a method for processing digital 
image data for two-dimensional arrays of sample substance spots and marker 
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substance spots in a plurality of electrophoresis gels by using external marker 
substances having predefined properties, the method comprising the steps of: 

- generating digital images represented by marker image data sets and sample image 
data sets comprising coordinate data and signal values corresponding to detected 
positions and signal values of said marker substance spots and sample substance 
spots in the arrays. 

- processing each said digital image to extract the marker substance data sets 
comprising coordinate data and signal values corresponding to detected positions and 
signal values of said marker substance spots in the arrays. 

- processing said digital images to determine a mathematical relation between the 
marker signals in said digital images such that said respective signals are mapped on 
each other. 

- generating sample images represented by sample image data sets comprising 
coordinate data and signal values corresponding to detected positions and signal 
values of said sample substance spots in the arrays 

- transforming said sample image signals according to said mathematical relation 

In still another preferred embodiment the electrophoresis gels contain a plurality of 
sets of, in relation to each other distinguishable, sample substances such that a 
plurality of sample images for each electrophoresis gel is acquirable. 

In still another preferred embodiment the step of generating said sample image 
comprises the step of scanning the array to form a pixel image, and wherein the step 
of transforming the sample image data signals comprises the step of transforming 
every pixel of said sample image into a transformed image, dependent on said 
mathematical relation. 

In still another preferred embodiment said array comprises two different marker 
substances having different properties, from which different properties co-ordinate 
data relating to ideal marker spot positions differing in at least one dimension for 
given electrophoreses operating conditions can be calculated. 

In still another preferred embodiment said array comprises a plurality of different 
marker substances having different properties, from which different properties co- 
ordinate data relating to ideal marker spot positions differing in two dimension for 
given electrophoreses operating conditions can be calculated. 



In still another preferred embodiment, dependent on the electrophoreses operating 
conditions, a set of marker substances is selected, comprising said plurality of different 
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marker substances, dependent on their corresponding co-ordinates of in the ideal 
image data. 

In still another preferred embodiment the step of normalising the sample image 
5 comprises the process steps of: 

- selecting a first pixel in the image plane defined by said ideal image data; 

- mapping said first pixel to the sample image; 

- reading the detected signal value for the mapped first pixel; 

- assigning said detected signal value to said first pixel in the image plane of the ideal 
10 image data; and 

- repeating these process steps for each pixel in the image plane. 

In still another preferred embodiment the step of reading the detected signal value for 
the mapped first pixel comprises the step of: 
15 - establishing a detected signal value for the mapped first pixel dependent on the 
signal value of at least one pixel in the sample image adjacent the mapped first pixel. 

In still another preferred embodiment the step of normalising the sample image 
comprises the process steps of: 
20 - selecting a first pixel in the sample image; 

- reading the detected signal value for said first pixel; 

- mapping said first pixel to the image plane defined by said ideal image data; 

- assigning said detected signal value to the mapped first pixel in the image plane of 
the ideal image data; and 

25 - repeating these process steps for each pixel in the sample image. 

In still another preferred embodiment the step of normalising the sample image 
comprises the process steps of: 

- selecting a first pixel in the image plane defined by said ideal image data; 
30 - reading the detected signal value for the mapped first pixel; 

- mapping said first pixel to the sample image; 

- assigning said detected signal value to said first pixel in the image plane of the ideal 
image data; and 

- repeating these process steps for pixels in the image plane defining an object 
35 consisting of sample substance spots and/or marker substance spots. 



In still another preferred embodiment the aforegoing further comprising the steps of: 
- determining the signal value for a first pixel in the ideal image plane dependent on 
the signal value of at least one mapped pixel in the ideal image plane adjacent said 
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first pixel in the ideal image plane; and 

- repeating this determination process for each pixel in the ideal image plane. 



In still another preferred embodiment the marker substances are non-protein 
5 compounds. 

In still another preferred embodiment said marker substance is an compound 
comprising a least a functional unit. 

10 In still another preferred embodiment said marker substance is an compound 
comprising a least a monomer unit and a functional unit. 

In still another preferred embodiment the detection of the landmark spots is done by 
monitoring the shapes of the landmark spots, both in two and three dimensions. 

15 

In still another preferred embodiment the detection of the landmark spots is done by 
monitoring the pattern(-s) of the landmark spots in the gel. 

In still another preferred embodiment the detection of the landmark spots is done by 
20 monitoring a comnbination of both shapes and pattern(-s) of the landmark spots in 
the gel. 

In still another preferred embodiment the detection of the landmark spots is done by 
monitoring the positions of the landmark spots in the gel. 

25 

In still another preferred embodiment the detection of the landmark spots is done by 
monitoring a combination of both shapes and positions of the landmark spots in the 
gel. 

30 In still another preferred embodiment the detection of the landmark spots is done by 
monitoring a combination of both positions and pattern(-s) of the landmark spots in 
the gel. 

In still another preferred embodiment the detection of the landmark spots is done by 
35 monitoring a combination of shapes, pattern(-s) and positions of the landmarks spots 
in the gel. 



Another aspect of the invention encompasses a computer program product for use in a 
computer system, comprising computer program code means for executing the 



WO 03/058230 ^) ^^CT/SE03/00029 

11 

computer system to process digital image data for a two-dimensional array of sample 
substance spots and marker substance spots in an electrophoresis gel, by performing 
the steps of: 

- accessing coordinate data for a marker image, relating to detected positions of 

5 marker spots of a predetermined substance in the array, dependent on electrophoresis 
operating conditions; 

- accessing coordinate data for another marker image, relating to detected positions of 
marker spots of said predetermined substance in the array; 

- determining a mathematical relation between said marker images, such that the 
10 coordinate data of said images are mapped onto each other; 

- generating a sample image represented by a sample image data set corresponding to 
detected signals in the gel; 

- transforming the sample image by transforming it dependent on said mathematical 
relation. 



15 



In a preferred embodiment of the computer program product it further comprises 
computer program code means for executing the computer system to perform the 
steps of any of the claims 1-27. 



20 Needless to say, this method can be used to compare plural gels, two by two. Since 
the markers are selected and have known properties, this transformation and 
subsequent comparison can be performed completely automatic for two or more gels, 
saving hours of work which as such constitutes a potential source of manual errors. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

The features of the present invention will be described in more detail below by means 
of preferred embodiments and examples, with reference being made to the appended 
drawings, in which 

Fig. 1 schematically illustrates a grid of co-ordinates relating to ideal positions of 
30 landmark spots in an electrophoresis gel for given electrophoresis conditions, 
according to an embodiment of the invention; 

Fig. 2 schematically illustrates a grid of co-ordinates relating to the detected positions 
of landmark spots in an electrophoresis gel for said given electrophoresis conditions, 
according to an embodiment of the invention; 
35 Fig. 3 schematically illustrates an image comprising the sample spots of the same 
electrophoresis gel, according to an embodiment of the invention; and 
Fig. 4 schematically illustrates a normalised image of the sample image in Fig. 3, 
according to an embodiment of the invention. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

As revealed above, the present invention relates to the method of using external 
markers in a gel electrophoresis process for normalising images from 2-D gels, making 
it possible to match sample spots between different images from 2-D gels, and to 
5 quantify sample spots in images from 2-D gels, in a more effective way than with the 
prior art techniques. 

Once a electrophoresis separation process has been performed on a gel comprising 
sample substances and landmark substances, these substances will be gathered in 

10 spots dependent on their properties, wherein each spot, or the centre thereof, 

corresponds to a certain parameter value pair in the two dimensions. The resulting 
pattern of spots will hence form a 2D array of more or less discrete spots. A suitable 
technique for capturing an image of this pattern is subsequently used, such that 
signals from the spots in the array are detected. Preferably the image is captured by 

15 scanning, but may also be captured by a single shot. The position of the landmark 
spots is then determined from the captured image. By using a digital scanner, the 
capturing of the image and the determination of the landmark spot positions may be 
performed in essentially the same step. From a single shot image, the spot positions 
may be determined by subsequent scanning of that image. 

20 

In order to detect the landmark spots, i.e. the positions of the landmark spots in the 
captured image, it is advantageous if the landmark substances are easily 
distinguishable from the sample substances. As described below this can be achieved 
in various ways. A suitable algorithm preferably is used to determine the co-ordinates 

25 for the centre of the spots for preferably all landmark spots in the gel. A data set of 
pairs of co-ordinates is thus formed, a detected landmark data set, representing 
signals in a detected image in an actual electrophoresis case. Each co-ordinate pair of 
the detected landmark image refers to a certain parameter value pair in the two 
dimensions of the 2D electrophoresis. One example of a landmark image, wherein the 

30 detected positions for the landmark spots are indicated, is shown in Fig. 2. 

Furthermore, a sample image is formed by capturing an image of the gel in which the 
sample spots are detected. As a person skilled in the art will realise, the landmark 
image and the sample image may be derived or formed from the same captured 
35 image, or from successive captured images. The sample image is digitally represented 
by a sample image data set comprising a signal value for each pixel in the captured 
sample image, or global image. This global image may or may not include the 
landmarks. Preferably, said signal value is expressed as a level of magnitude in a 
scale, such as a grey scale, relating to the exposure of the image in that particular 
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pixel. An example of a sample image of a separated electrophoresis gel is shown in 
Fig. 3, corresponding to tbe landmark image of Fig. 2. 



For different reasons, such as inconsistencies in the gel, the landmark substances in 
5 an actual electrophoresis run will rarely be located as would be expected from their 
known properties and the chosen electrophoresis conditions. This is one of the reasons 
why simple direct comparison between two electrophoresis cases is not always 
reliable, since this uncertainty naturally is equally valid for the sample substances. 

10 According to the invention, this problem is overcome by transforming the global image 
such that the external landmarks are positioned in the co-ordinates to which their 
properties in the separation processes correspond. This way, two or more 
electrophoresis gels ran under the same conditions may be independently transformed 
into a reference format, such that they may be compared to each other. The 

15 transformation feature of the invention is based on usage of known landmarks. One 
feature of using landmarks selected from substances of known properties is that when 
the conditions of the separation process of the electrophoresis are known, also the 
expected positions which these marker substances would ideally assume in the gel are 
known, or can be derived. By said conditions is here meant for instance the type, e.g. 

20 isoelectric point and molecular size, and range of parameters used to separate the 
samples in the gel, and the type of gel used. Consequently, once a set of landmark 
substances is chosen and certain electrophoresis conditions are selected, a pattern of 
discrete co-ordinates corresponding to the positions those landmark substances would 
assume, based upon their properties, can be derived. Accordingly, a data set of pairs 

25 of reference co-ordinates is thus formed, here denoted a reference landmark data set, 
representing signals in a reference image or ideal image relating to an ideal 
electrophoresis case. Each co-ordinate pair of the reference image refers to a certain 
parameter value pair in the two dimensions of the 2D electrophoresis. An example of 
an ideal marker image, corresponding to the landmark image derived from the 

30 detected landmarks as shown in Fig. 2, is illustrated in Fig. 1. By comparison between 
these two figures it will be obvious that the positions of the corresponding markers do 
not coincide perfectly. 

In order to be able to perform the aforementioned transformation, a mathematical 
35 relation between the reference landmark data set and the detected landmark data set 
is determined, such that the co-ordinates of the landmarks, or markers, in the two 
images represented by said data sets, i.e. the reference image and the detected 
landmark image, are mapped onto each other. The mathematical relation for this 
mapping is calculated from the position differences between corresponding co-ordinate 
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pairs in the respective sets, but at the same time the mathematical relation is 
continuous fo khe entire reference image. 

Each pixel of the transformed image is then created by collecting the value for the 
5 corresponding pixel in the detected global image according to the following basic 
procedure. A first pixel in the image plane of the reference image is mapped by said 
mathematical relation into the image plane of the detected global image. If th^ ;lobal 
image is distorted compared to what would have been expected under the present 
conditions, the co-ordinates of the mapped pixel will not be the same as the original 

10 pixel in the reference image plane. Once the mapped pixel co-ordinates have been 
calculated, the signal value of that co-ordinate is read in the sample image data set, 
and that signal value is thereafter assigned to said first pixel in the reference image 
plane. Each part of the reference image plane is then built up pixel by pixel to form a 
complete transformed image of the detected global image. 

15 Consequently, the entire detected image of samples is transformed by this mapping. 
The detected image is then globally transformed. An example of a such a globally 
transformed, normalised, image is illustrated in Fig. 4. In fact, the image in Fig. 4 is 
transformed from the detected sample image of Fig. 3, using a mathematical relation 
derived from the relation between the ideal and detected landmark images of Figs 1 

20 and 2, respectively, in accordance with an embodiment of the invention. All 

electrophoresis test items separated by this specific type of gel and strip, and having 
the corresponding set of landmarks, will in this globally transformed format have their 
landmarks placed on exactly the same positions or co-ordinates. A specific example of 
determining a mathematical relation for mapping the landmark images onto each 

25 other is described further below. 

In a preferred embodiment of the invention, external landmarks are used. Preferably 
these external landmarks are artificial molecules which are not proteins, as disclosed 
in the related patent application SE 0103103-8, which is hereby enclosed by way of 
30 reference. 

Several technical effects are obtained by the invention. The analysis of the detected 
image is automised in a higher degree than in the prior art solutions. By adding 
external knowledge, i.e. substances of known properties, the time-consuming manual 
35 placing of landmarks in the image, which is also a potential source of errors, in order 
to obtain a match with another image, is eliminated. Instead, each image is 
independently processed into a format in which it is comparable with any other 
processed image carrying the same landmarks. 



WO 03/058230 ^) ^^CT/SE03/00029 

15 

Using selected artificial landmarks further brings about the effect that landmark 
substances can be selected such that they form a desired pattern over the image, e.g. 
with substantially equal distance between each neighbouring landmark spots, resulting 
in a more even accuracy over the entire image and a more accurate transformation. 
5 The proteins functioning as landmarks in the prior art are spatially rather undefined 
and differing from time to time, which results in an uncertainty in the positioning in 
the landmarks from a spot. Detection of the artificial landmarks is more secure since 
the same amount of a certain landmark substance is always selected, wherein the spot 
will essentially always look the same. Therefore, the algorithm used will determine the 
10 co-ordinates in essentially the same way each time, rendering a more accurate 
positioning. 



According to an embodiment of the invention, the marker compounds suitable for gel 
electrophoresis are added externally, and are not chosen from the samples, as 
15 revealed above. The marker compound may be a polymer or it may be selected from 
the group consisting of glycoconjugates, carbopeptoides, polynucleotides, proteo- 
glycanes, fullerenes, carbohydrates and mixtures thereof. 

In specific embodiments, the marker compound is characterised by a pi of about 2-12. 
20 In still further embodiments, it is characterised by a pi of about 3-10. 

Further embodiments include marker compound characterised by a Mw of about 5-10 6 
Da. In still even further embodiments, the Mw may be of about 10 3 -10 5 Da, 5-1000 
Da, 5-600 Da, 5-250 Da or an otherwise suitable interval of molecular size due to the 
25 size of the samples to be separated and detected. Specific embodiments comprise 

marker compounds characterised by a pi of about 1-12 and by a Mw of about 100-10 6 . 

In one embodiment the marker compound is a dendrimer. Dendrimers are built up 
from different structural building blocks, some that may include branching points to 
30 achieve a tree like structure characteristic for dendrimers. The different building blocks 
are coupled together to achieve said treelike structure. 

The dendrimers offer two important features when used as marker molecules, namely 
a) the size can be easily modified by adding more layers and b) the feature of the 
35 dendrimeric compounds can be modified by adding different functional groups to the 
layers. Since the individual building blocks may be composed of repeatable units, the 
dendrimer molecule may be built up from only a few numbers of coupling steps, which 
is economically and technically advantageous. 
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According to the invention, the dendrimer may comprise at least one monomer, at 
least one functional group and optionally at least one core as building blocks. The 
dendrimer may be built up from separate building blocks as shown in figure 2 in the 
patent application application SE 0103103-8 referred to above. Said dendrimers may 
5 be built up either by a divergent strategy or by a convergent strategy as known in the 
art of dendrimer synthesis. 

In one embodiment, the dendrimer to be used as external landmarks according to the 
invention is synthesised according to the divergent strategy. 

10 

The dendrimer may be represented by the general formula 
(core) n (monomeri... 0 )x (functional group i... p) 

15 wherein n is an integer from 0-5 representing number of different co-existing optional 
cores, 

wherein o is an integer from 2-1000 representing number of different monomer 
building blocks within the monomer distributed over x layers, 
wherein x is an integer from 1-20 representing number of layers, and 
20 wherein p is an integer from 1-20 representing the number of different functional 
groups within one functional group building block. 

Other suitable dendrimers to be used as landmarks according to the invention are 
commercially available dendrimers. Though, such dendrimers lack at least one 
25 functional group according to the invention. Such a functional group may then, of 
course, be coupled onto the commercially available dendrimers using techniques 
known to the skilled man in the art. Examples of, but not limited to, commercially 
available dendrimers that may be used according to the invention are Astramol™ 
(DSM Agro, The Netherlands) and Starbust® (Aldrich) 

30 

In specific embodiments of the invention, synthetic amino acid dendrimers will be 
used. In still a specific embodiment, a diamin or a triamin is used as a core, 
diaminobenzoic acid as a monomer and aspartic acid as a functional group. By 
introducing different numbers of diaminobenzoic acid as a monomer a huge range of 
35 molecular masses may be achieved as exemplified in the table below. 
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Molecular size of dendrimers made of a 
diamin as core building block, 
diaminobenzoic acid as a monomer (mon) 
and aspartic acid as a functional group. 



mon Molecular size 



0 


290 


1 


788 


2 


1785 


3 


3779 


4 


7767 


5 


15743 


6 


31694 


7 


63597 



The dendrimer may comprise at least one core. According to specific embodiments, the 
5 core may include divalent, trivalent, tetravalent, multivalent cores and mixtures 
thereof. According to the invention, the at least one core may be selected from the 
group consisting of the formulas in figure 3 of SE 0103103-8, and mixtures thereof. 

In one embodiment, the core may be a diamine where n=2 or a triamine where n=l. 

10 

The core may contain further branching units, allowing the treelike structure 
characteristic of the dendrimer to form. 

An example of a dendrimer using diaminoethane as the core is shown in figure 4 of SE 
15 0103103-8. 

The dendrimer also comprises at least one monomer. The monomer may include 
further branching possibilities to the dendrimer molecule to achieve the treelike 
structure characteristic of the dendrimer. Due to the numbers of branching 
20 possibilities, the monomer may be monovalent, i.e., to elongate without branching, 
divalent, trivalent, tetravalent, multivalent or mixtures thereof. Examples of such 
monomers with monovalent and divalent branching units are shown in figure 5 of SE 
0103103-8. 

25 Specific embodiments use amid bondings between the different monomers. 
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In still a further embodiment, the at least one monomer may be 3,5- diaminobenzoic 
acid, as shown in figure 4 of SE 0103103-8. 



The number of monomers will contribute to the final molecular size of the dendrimer. 
5 According to specific embodiments, using 3,5- diaminobenzoic acid as monomer the 
monomer may be in a number of 1-10. 

The dendrimer may also contain a functional group according to the invention. The at 
least one functional group will to the dendrimer ad known characteristics. An example 
10 of known characteristics is, e.g. a desired net charge to the molecule. 

In specific embodiments, at least one functional group may be added to the free ends 
generated in the coupling step. Due to the addition of the at least one functional 
group, the landmark will according to the invention be able to position in the ge) if the 
15 gel parameters used for separation are selected so as to enable separation of the 
marker compound characteristics. 

According to different embodiments the at least one functional group may be selected 
from the group of functional groups shown in figure 6, and zwitterions, anionic or 
20 cationic, oligopeptides, alcohols, tiols, carboxy acids, amines; fluorochromes, such as 
fluorescamine, isotopes and mixtures thereof . 

In a specific embodiment, the fluorescamine used is the commercially available 
Fluram® available from Molecular Probes, U.K. 

25 

In still further specific embodiments, the at least one functional group is selected from 
the group consisting of an amino acid such as aspartic acid, glutamic acid. 

According to the invention, the marker compound has known characteristics affecting 
30 its migration in a gel during gel electrophoresis so as to position the marker compound 
in said gel. Such characteristics may, of course, be included by the addition of a 
functional group to the dendrimer. Even further, it may reflect the molecular size of 
the marker compound. 

35 By selecting landmarks with chosen characteristic, such as, e.g. pi and molecular size 
values, the external added landmarks may position in a desired way over the gel. Such 
information about characteristics of the external markers may be used for further 
analysis of unknown samples, such as unknown proteins. 
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According to the invention, a set of marker substances may form at least two marker 
spots in a gel after an electrophoresis run. As such, these spots will appear in different 
areas in the gel, due to the known characteristics of the marker compounds that form 
said spots, separated in at least one dimension. The number of marker spots may 
5 differ. This is due to the large number of different types of gels to be used in the gel 
electrophoresis step. Another factor that may affect the number of markers needed are 
the number of sample spots on the gel, which is obvious for the skilled man in the art 
of gel electrophoresis and separation of samples such as e.g., protein samples. Other 
factors that affect the number of spots needed are the number of separation 

10 dimensions. The invention is applicable to electrophoresis separation in at least one 
dimension and preferably two, but may be applied in separation in more than two 
dimensions. The number of marker spots needed may therefore be, such as about 0- 
50% of the number of sample spots, preferably 1-20%, e.g. 1-10% of the number of 
sample spots. Specific embodiments of the invention may use marker spots, wherein 

15 the at least two marker spots may be from about 2-1000 marker spots per said gel. In 
still further embodiments, the at least two marker spots may be from about 2-500, 
10-250, 10-100, or about 20-40 marker spots per gel. In an embodiment of a two 
dimensional gel, e.g. a polyacrylamid gel, in a size of about 24x18 cm, the number of 
marker spots needed may be about 9-40, e.g. 9-16 marker spots. 

20 

The grid formed by the distributed marker spots in the gel may be spatially even, but 
the marker set may also be selected such that the marker spots will appear unevenly 
distributed for various reasons. One such reason may be that is known or suspected 
beforehand that certain sample substances will be present in the gel and will gather in 
25 a specific region of the gel. It may then be desirable to select markers which will 

equally be gathered in the vicinity of this region, in order to have a sufficient amount 
of landmark readings in this region in order to improve the sample quantification 
accuracy. 

30 The marker spots appearing in a gel will contain different number of marker 

compounds per marker spot. However, the amount of marker compounds should be 
enough to allow a clear and concise detection of the marker spot on the gel. This is, of 
course, dependent on how the sample spots are detected, equipment and/or gel 
resolution. 

35 

According to specific embodiments of the invention, at least two marker compounds 
are used forming at least two marker spots which may position in the gel at ideal 
positions. As used herein, the word ideal is intended to mean ideal due to empirical 
information, such as experimental data, or from theoretical information, such as 
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chemical and/or physical data characterising the marker compounds used. Such 
compounds may be designed to be dependent upon e.g. pi and molecular size for their 
positioning. Of course the marker compounds may be designed in different ways, as 
obvious to the skilled man in the art, to depend upon other characteristics for 
5 separation according to other dimensions than pi and molecular size. 

The set of external landmarks may in specific embodiments comprise at least two 
marker compounds forming at least two spots, wherein the at least two marker 
compounds are characterised by a pi of about 1-12, such as about 3-10. In still further 
embodiments, the at least two marker compounds may be characterised by a Mw of 
10 about 100- 10 6 , such as about 10 3 -10 5 . Specific embodiments of the invention may 

include two marker compounds characterised by a pi of about 1-12 and a Mw of about 
100-10 6 . 

The application of a set of marker or landmark substances or compounds in a gel may 
15 include applying the set in the form of application strips or mixing and applying the set 
together with the test samples, or applying the set at the time of casting of the gel. 

When an electrophoresis separation has been made and the sample and marker 
substances have been positioned into spots in the gel, information about the position 

20 of each separated marker compound needs to be collected and stored. This 

information may be collected using any of the determination processes selected from a 
group comprising visual light, UV, IR, multispectral imaging, isotope labelling, 
colouring techniques such as silver staining, Comassie staining, and fluorescence 
techniques. The external marker substances are preferably selected such that they are 

25 distinguishable from the sample spots in the gel. This may be achieved by using 

marker substances having optical properties different from the optical properties of the 
sample spots, making it possible to detect the external markers in the gel. The marker 
spots are preferably detected by scanning the gel and picking up signals representing 
the positions of the spots using any of the techniques in the aforementioned group. 

30 The scanning procedure results in a digitized image, which is stored in a data memory. 

The stained gels, both containing separated landmarks and sample molecules, may be 
scanned in a scanner with dual detection possibility. With dual detection possibility, it 
is intended to mean that the scanner has the possibility to detect signals from 
35 different staining techniques, such as silver staining, fluorescent staining, 

radioactivity, or any other staining method used. In one embodiment, the landmarks 
are detected in a separate image, the marker image, after a scanning step, enabling 
detection of the landmarks positions only. Without changing the position of the gel, 
the parameters of the scanning apparatus are changed, so as to enable a separate 
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scanning and detection of the separated sample molecules, e.g. the proteins, in order 
to register a sample image. The two gel images, one with the separated landmarks 
and one with the separated sample proteins, now collected in digitalized form, are 
subsequently used for the image analysis of the gel. The gel may also be scanned 
once, detecting the sample molecules and the proteins in one image. This may be 
convenient when the same detection methodology is used for the sample molecules 
and the landmarks, but when the optical and/or the geometrical properties of the 
marker spots and the sample spots allow differentiation between them. 
The marker image is used for extraction of the co-ordinates for the spots of external 
markers in the image. This is done by standard techniques in mathematical imaging, 
and any of a number of methods could be used. The result is a file consisting of data 
representing the co-ordinates from the detected external markers in the gel. 
In accordance with the invention, the properties of the external markers according to 
the two separation parameters are known. For the set of external markers added to 
the sample prior to the electrophoresis a file of those properties has been established 
and stored in a file on a computer readable storage media, such as a hard drive, a CD- 
ROM or other data storage device. The differences between the known properties and 
the observed positions of the external markers in the gel are subsequently used to 
construct a mathematical relation mapping the one set of data onto the other. The 
relation should due to obvious reasons be injective and continuous over the whole 
defined image area. This mathematical relation could be defined and/or determined in 
any of a number of ways. The result is a mathematical function mapping the existing, 
detected, image of the sample spots, or the sample spots plus the marker spots, onto 
another image, the globally transformed image". Below, one example of an 
embodiment for carrying out the invention is described with reference to the 
accompanying drawings. 

A silver stained gel formed in an electrophoresis separation process is placed in a 
scanner and scanned in two different ways, or in a way that allows differentiating the 
marker spots from the sample spots, in order to extract the co-ordinates for the spot 
positions of the selected marker substances. The co-ordinates could typically be 
determined using segmentation of the marker spots, e.g. by using Watershed 
algorithm. A modified Gaussian distribution model of the marker spots may further be 
used to locate the centre of gravity for the spots and use the projection onto the x-y 
surface as the co-ordinates for the spots. The co-ordinate data thus extracted is stored 
in a file. The file of data for the set of detected spots of landmarks substances used in 
the image, and the file of the properties of said landmark substances, are loaded into 
a computer carrying a computer program devised in accordance with the invention. 
The computer program now has access to the observed data and the "known" data 
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relating to the landmarks. The known data on file may be specifically stated in the 
format of parameter pairs relating to the separation process used, but may also be 
stored in a form from which this value pair may be calculated dependent on 
electrophoresis conditions. 

5 

The co-ordinates for the ideal landmark positions are defined and accessible in the 
computer program as vectors named P x and P y . The co-ordinates for the detected 
landmark positions are defined and accessible in the computer program as vectors V x 
and V y . The x and y relate to the x-coordinates and the y-coordinates of the respective 
10 positions. 

Using the P x and the P y , all distances between the landmarks in the image plane of 
the sought normalised image are calculated and stored in a vector r. 
A function U(r) = r 2 * log r 2 is then calculated for all the r's, and stored in a matrix K. 
15 This matrix is thus a n*n matrix over the n landmarks in the following manner: 

U(r ln ) 
U(r 2n ) ^ 

0 

A matrix P is constructed: P = [ ] , that is a 3 * n matrix: [1 P x P y ]. 

1 * n y» 

20 Then a matrix or operator L is constructed according to: 
K P 

L = [ pT ], (n + 3) * (n + 3), where P T is the transpose of the P matrix and 0 is a 
3*3 matrix of zeros. 

25 Let V = (v x , v 2 , ... , v n ) be any n-vector, and write Y = ( V | 0 0 0) T , a column vector of 
length n + 3. A vector W = (w 1# w 2 , ... , w n ) and coefficients a lt a x/ a y are then defined 
by the equation L _1 Y = (W | ai a x a y ) T . According to the invention the elements of L~ 2 Y 
are subsequently used to define a function f(x,y) everywhere in the plane: 

30 

f{x, y) = a } +a x *x + a y *y + £ w,UQ P, - (x, y) |) . 

i=\ 



K = [ 



0 C/(r 12 ) 
C/(r 21 ) 0 
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The function f is divided into two parts: a sum of functions U(r) and an affine part. 



Let V = [ X] *? X " ] where each landmark homologous to (Xi, y,) in another copy 

of R 2 . Here, R represents the real numbers, and consequently R 2 represents the real 
number pairs in the plane. 

The application of L* 1 to the first column of V T specifies the coefficients of 1, x, y, and 
the U's for f x (x,y), the x-coordinate of the image of (x, y); the application of L* 1 to the 
second column of V T does the same for the y-coordinate f y (x,y). The resulting function 
f(x,y) = [f x (x,y), f y (x,y)] is now vector-valued: it maps each point (x { , yO to its 
homolog (Xj', y,'). Furthermore, this function is the least bent of all such functions, 
according to the measure I f , integral quadratic variation over all R 2 , computed 
separately for real and imaginary parts of f and summed. These vector-valued 
functions f(x,y) are the thin-plate spline mappings used in this method. If the pairings 
of points between the sets is in accordance with biological homology, the function f 
models the comparison of biological forms as a deformation. The whole procedure is 
obviously invariant under translation or rotation of either set of landmarks. 

The application of L" 1 to both the columns of V T thus specifies the coefficients for the 
transform between the to sets of landmarks. The coefficients are stored in the matrix 
"koeff". 

The coefficients are now applied to all the pixels in the sought normalized image, 
pixel-by-pixel, and the corresponding pixel value in the distorted image is determined. 

For this calculated pixel in the distorted image, the grey-level (0-255) is determined. 
To determine the grey-level for a non-integer pixel value, some sort of interpolation 
can be used. In our example a discrete bi-linear interpolation is used. The grey-level 
thus determined are appointed to the pixel in the sought normalized image. 

The next pixel in the sought normalized image is handled in the same way. In cases 
where the transform is mapping the pixel in the sought image to a pixel-value outside 
the defined area in the distorted image, the pixel in the sought image is set to 0 
(black). 
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5 



10 



When all pixels are transformed, and appointed a grey-level, the sought image is 
created and can be plotted and handled as a new image, corrected according to the 
information from the pair of landmark sets. 

Each point in the sought normalized image plane is mapped by a function 

(x, 7) (*' > y ) = (* + /* (*> y)> y + fy (x> y)) * 

where the f x and f y are the function based on £(-l)*t/(| (x>y) - D k |) as viewed in 3 
dimensions. 

Although several examples of embodiments have been described a person skilled in 
the art will realise then further modifications are possible within the scope of the 
appended claims. 



15 
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CLAIMS 

1. Method for processing digital image data for two-dimensional arrays of sample 
substance spots and marker substance spots in a plurality of electrophoresis gels by 
using marker substances having predefined properties, comprising the steps of: 

5 . generating marker images represented by marker image data sets comprising 
coordinate data and signal values corresponding to detected positions and signal 
values of said marker substance spots in said arrays; 

- processing said marker images to determine a mathematical relation between said 
marker signals in said marker images such that said respective signals are mapped on 

10 each other; 

. generating sample images represented by sample image data sets comprising 
coordinate data and signal values corresponding to detected positions and signal 
values of said sample substance spots in said arrays; 

- transforming said sample image signals according to said mathematical relation 

15 

2. Method for the detection, matching and quantification of sample substances in a 
plurality of electrophoresis gels by using external marker substances having 
predefined properties, the method comprising the steps of: 

- combining each of said gels with said sample substances and a plurality of said 
20 external marker substances each having a known identity and known characteristics 

that affect its migration in two-dimensional electrophoresis, which said external 
marker substances are selected such that they are distinguishable from said sample 
substances; 

- performing two-dimensional electrophoresis on said gels thus combined with said 
25 sample substances and said external marker substances to form a two-dimensional 

array, each array including said external marker substance spots corresponding to said 
external marker substances and sample spots corresponding to said sample 
substances; 

- scanning each array to form a first image of signals corresponding to said external 
30 marker substance spots and a second image of signals corresponding to said sample 

substance spots, the signals of said first image being distinguishable from those of 
said second image; 

- processing said first image together with the known identity and known 
characteristics of said external marker to determine a mathematical relation between 

35 said first image signals and signals representing said known identity and said known 
characteristics of said external marker substances such that said respective signals are 
mapped on each other; 

- transforming said second image signals dependent on said mathematical relation 
such that the external marker substance spots are positioned as external marker 
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reference spots on the co-ordinates in the two-dimensional array that correspond to 
their properties in said electrophoresis gel; 

- determining the properties of said sample spots dependent on said external marker 
reference spots in said transformed second image. 

3. Method for analysing samples substances by comparison between a first 
electrophoresis gel and a second electrophoresis gel by using external marker 
substances having predefined properties, the method comprising the steps of: 

- combining each of said gels with said sample substances and a plurality of said 
external marker substances each having a known identity and known characteristics 
that affect its migration in two-dimensional electrophoresis, which said external 
marker substances are selected such that they are distinguishable from said sample 
substances; 

- performing two-dimensional electrophoresis on said gels thus combined with said 
sample substances and said external marker substances to form a two-dimensional 
array, each array including said external marker substance spots corresponding to said 
external marker substances and sample spots corresponding to said sample 
substances; 

-scanning each array to form a marker image of signals corresponding to said external 
marker substance spots and a sample image of signals corresponding to said sample 
substance spots, the signals of said marker image being distinguishable from those of 
said sample image; 

- determining a mathematical relation between the signals of the respective marker 
image of said sample gels, such that the signals corresponding to spots of the same 
marker substance in the marker images of the first and the second electrophoresis gel 
are mapped on each other; 

- transforming the sample image of the first gel dependent on said mathematical 
relation, such that the sample images of both gels relate to the same external marker 
substance positions; and 

- determining differences between the two electrophoresis gels by directly comparing 
the transformed sample image of the first gel with the sample image of the second 
gel. 

4. Method for analysing samples substances by comparison between a first 
electrophoresis sample gel and a second electrophoresis gel by using external marker 
substances having predefined properties, the method comprising the steps of: 

- combining each of said sample gels with a common set of external marker 
substances each having known properties affecting its migration in two-dimensional 
electrophoresis; 
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- performing two-dimensional electrophoresis on each of said sample gels thus 
combined with said external marker substances to form a two-dimensional array, each 
array including external marker substance spots corresponding to said external marker 
substances and sample spots corresponding to said sample substances; 

5 -scanning each array to form a marker image of signals corresponding to said external 
marker substance spots and a sample image of signals corresponding to said sample 
substances spots, the signals of said marker image being distinguishable from those of 
said sample image; 

- applying a mathematical relation between the signals of the respective marker image 
10 of said sample gels, such that the signals corresponding to spots of the same marker 

substance in the marker images of the first and the second electrophoresis gel are 
mapped on each other; 

- transforming the sample image of the first gel dependent on said mathematical 
relation, such that the sample images of both gels relate to the same marker 

15 substance positions; and 

- determining differences between the two electrophoresis gels by directly comparing 
the transformed sample image of the first gel with the sample image of the second 
gel. 

20 5. Method for processing digital image data for a two-dimensional array of sample 
substance spots and landmark substance spots in an electrophoresis gel by using 
landmark substances having predefined properties, comprising the steps of: 

- defining ideal image data comprising co-ordinate data corresponding to ideal 
positions of the marker substance spots in said array dependent on electrophoresis 

25 conditions; 

- generating a marker image represented by marker image data comprising co- 
ordinate data corresponding to detected positions of said marker substances spots in 
the array; 

- determining a mathematical relation between the ideal image data and the marker 
30 image data, such that the co-ordinate data of said images are mapped onto each 

other; 

- generating a sample image represented by a sample image data set corresponding to 
detected signal values in the gel; 

- transforming said sample image according to said mathematical relation. 

35 

6. Method for processing digital image data for two-dimensional arrays of sample 
substance spots and marker substance spots in a plurality of electrophoresis gels by 
using external marker substances having predefined properties, the method 
comprising the steps of: 
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- generating digital images represented by marker image data sets and sample image 
data sets comprising coordinate data and signal values corresponding to detected 
positions and signal values of said marker substance spots and sample substance 
spots in the arrays. 

- processing each said digital image to extract the marker substance data sets 
comprising coordinate data and signal values corresponding to detected positions and 
signal values of said marker substance spots in the arrays. 

- processing said digital images to determine a mathematical relation between the 
marker signals in said digital images such that said respective signals are mapped on 
each other. 

- generating sample images represented by sample image data sets comprising 
coordinate data and signal values corresponding to detected positions and signal 
values of said sample substance spots in the arrays 

- transforming said sample image signals according to said mathematical relation 

7. Method according to claim 6, wherein the step of processing each image to extract 
marker substance data sets, comprises the process steps of: detecting landmark spots 
according one or more of the following criteria 

- the shapes of the landmark spots, both in two and three dimensions. 

- the pattern/s of the landmark spots in the gel. 

- a combination of both shapes and pattern/s of the landmark spots in the gel. 

- the positions of the landmark spots in the gel. 

- a combination of both shapes and positions of the landmark spots in the gel. 

- a combination of both positions and pattern/s of the landmark spots in the gel. 

- a combination of shapes, pattern/s and positions of the landmark spots in the 
gel.. 

8. Methods according to claims 1-7, wherein the electrophoresis gels contain a 
plurality of sets of, in relation to each other distinguishable, sample substances such 
that a plurality of sample images for each electrophoresis gel is acquirable. 

9. Method according to claims 1- 8, wherein the step of generating said sample image 
comprises the step of scanning the array to form a pixel image, and wherein the step 
of transforming the sample image data signals comprises the step of transforming 
every pixel of said sample image into a transformed image, dependent on said 
mathematical relation. 



10. Method according to claim 8 or 9, wherein said array comprises two different 
marker substances having different properties, from which different properties co 
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ordinate data relating to ideal marker spot positions differing in at least one dimension 
for given electrophoreses operating conditions can be calculated. 



11. Method according to claim 8 or 9, wherein said array comprises a plurality of 
5 different marker substances having different properties, from which different 

properties co-ordinate data relating to ideal marker spot positions differing in two 
dimension forgiven electrophoreses operating conditions can be calculated. 

12. Method according to claim 11, wherein, dependent on the electrophoreses 

10 operating conditions, a set of marker substances is selected, comprising said plurality 
of different marker substances, dependent on their corresponding co-ordinates of in 
the ideal image data. 

13. Method according to claim 8, wherein the step of normalising the sample image 
1 5 comprises the process steps of: 

- selecting a first pixel in the image plane defined by said ideal image data; 

- mapping said first pixel to the sample image; 

- reading the detected signal value for the mapped first pixel; 

- assigning said detected signal value to said first pixel in the image plane of the ideal 
20 image data; and 

- repeating these process steps for each pixel in the image plane. 

14. Method according to claim 13, wherein the step of reading the detected signal 
value for the mapped first pixel comprises the step of: 

25 - establishing a detected signal value for the mapped first pixel dependent on the 

signal value of at least one pixel in the sample image adjacent the mapped first pixel. 

15. Method according to claim 8, wherein the step of normalising the sample image 
comprises the process steps of: 

30 - selecting a first pixel in the sample image; 

- reading the detected signal value for said first pixel; 

- mapping said first pixel to the image plane defined by said ideal image data; 

- assigning said detected signal value to the mapped first pixel in the image plane of 
the ideal image data; and 

35 - repeating these process steps for each pixel in the sample image. 

16. Method according to claim 8, wherein the step of normalising the sample image 
comprises the process steps of: 

- selecting a first pixel in the image plane defined by said ideal image data; 



WO 03/058230 




T/SE03/00029 



30 



- reading the detected signal value for the mapped first pixel; 

- mapping said first pixel to the sample image; 

- assigning said detected signal value to said first pixel in the image plane of the ideal 
image data; and 

5 . repeating these process steps for pixels in the image plane defining an object 
consisting of sample substance spots and/or marker substance spots. 

17. Method according to claim 10, further comprising the steps of: 

- determining the signal value for a first pixel in the ideal image plane dependent on 
10 the signal value of at least one mapped pixel in the ideal image plane adjacent said 

first pixel in the ideal image plane; and 

- repeating this determination process for each pixel in the ideal image plane. 

18. Method according to claims 1-17, wherein the marker substances are non-protein 
15 compounds. 

19. The method as recited in any of the previous claims, wherein said marker 
substance is an compound comprising a least a functional unit. 

20 20. The method as recited in any of the previous claims, wherein said marker 

substance is an compound comprising a least a monomer unit and a functional unit. 

21. Method according to one or more of the preceding claims wherein the detection of 
the landmark spots is done by monitoring the shapes of the landmark spots, both in 

25 two and three dimensions. 

22. Method according to one or more of the preceding claims wherein the detection of 
the landmark spots is done by monitoring the pattern(-s) of the landmark spots in the 
gel. 

30 

23. Method according to one or more of the preceding claims wherein the detection of 
the landmark spots is done by monitoring a comnbination of both shapes and 
pattern(-s) of the landmark spots in the gel. 

35 24. Method according to one or more of the preceding claims wherein the detection of 
the landmark spots is done by monitoring the positions of the landmark spots in the 
gel. 
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25. Method according to one or more of the preceding claims wherein the detection of 
the landmark spots is done by monitoring a combination of both shapes and positions 
of the landmark spots in the gel. 

5 26. Method according to one or more of the preceding claims wherein the detection of 
the landmark spots is done by monitoring a combination of both positions and 
pattern(-s) of the landmark spots in the gel. 

27. Method according to one or more of the preceding claims wherein the detection of 
10 the landmark spots is done by monitoring a combination of shapes, pattern(-s) and 

positions of the landmarks spots in the gel. 

28. A computer program product for use in a computer system, comprising computer 
program code means for executing the computer system to process digital image data 

15 for a two-dimensional array of sample substance spots and marker substance spots in 
an electrophoresis gel, by performing the steps of: 

- accessing coordinate data for a marker image, relating to detected positions of 
marker spots of a predetermined substance in the array, dependent on electrophoresis 
operating conditions; 

20 - accessing coordinate data for another marker image, relating to detected positions of 
marker spots of said predetermined substance in the array; 

- determining a mathematical relation between said marker images, such that the 
coordinate data of said images are mapped onto each other; 

- generating a sample image represented by a sample image data set corresponding to 
25 detected signals in the gel; 

- transforming the sample image by transforming it dependent on said mathematical 
relation. 



30 



29. Computer program product according to claim 28, further comprising computer 
program code means for executing the computer system to perform the steps of any of 
the claims 1 - 27. 
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